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Abstract 

The existence of a polynomial kernel for Odd Cycle Transversal was a notorious open 
problem in parameterized complexity. Recently, this was settled by the present authors (Kratsch 
and Wahlstrom, SODA 2012), with a randomized polynomial kernel for the problem, using 
matroid theory to encode flow questions over a set of terminals in size polynomial in the number 
of terminals (rather than the total graph size, which may be supcrpolynomially larger). 

In the current work we further establish the usefulness of matroid theory to kernelization by 
showing applications of a result on representative sets due to Lovasz (Combinatorial Surveys 
1977) and Marx (TCS 2009). We show how representative sets can be used to give a polynomial 
kernel for the elusive Almost 2-SAT problem (where the task is to remove at most k clauses 
to make a 2-CNF formula satisfiable) , solving a major open problem in kernelization. 

We further apply the representative sets tool to the problem of finding irrelevant vertices 
in graph cut problems, that is, vertices which can be made undeletable without affecting the 
status of the problem. This gives the first significant progress towards a polynomial kernel for 
the Multiway Cut problem; in particular, we get a kernel of 0(k s+1 ) vertices for Multiway 
Cut instances with at most s terminals. 

Both these kernelization results have significant spin-off effects, producing the first polyno- 
mial kernels for a range of related problems. 

More generally, the irrelevant vertex results have implications for covering min-cuts in graphs. 
For a directed graph G = (V, E) and sets S, T C V, let r be the size of a minimum (S, T)-vertex 
cut (which may intersect S and T). We can find a set Z C V of size 0(15*1 • |T| • r) which 
contains a minimum (A, £?)-vertex cut for every A C S, B C T '. Similarly, for an undirected 
graph G = (V,E), a set of terminals X C V, and a constant s, we can find a set Z C V of 
size OdXl^ 1 ) which contains a minimum multiway cut for any partition of X into at most s 
pairwise disjoint subsets (see the paper for a detailed description). Both results are polynomial 
time. We expect this to have further applications; in particular, we get direct, reduction rule- 
based kernelizations for all problems above, in contrast to the indirect compression-based kernel 
previously given for Odd Cycle Transversal. 

All our results are randomized, with failure probabilities which can be made exponentially 
small in n, due to needing a representation of a matroid to apply the representative sets tool. 
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1 Introduction 



Polynomial kernelization is a formalization of the notion of efficient polynomial-time preprocessing, 
or more generally of efficient instance simplification and data reduction. Such reduction steps are 
commonly applied in practice, see, e.g., the well-known CPLEX integer programming package, 
or many state-of-the-art SAT solvers. However, to study this theoretically, one needs a notion 
of the hardness of an instance beyond the instance size, e.g., the length of a certificate |20| or a 
more generic parameter associated with the input (cf. |12[ I33j). Informally, a kernelization is a 
polynomial-time reduction of an input instance, with parameter value k, to an equivalent instance 
of the same problem, the kernel, with total output size bounded as a function of k; a problem 
has a polynomial kernel if the size bound is polynomial in k. This turns out to be a robust and 
interesting notion, and there is much work on both upper and lower bounds for the existence of, or 
best possible size of, a polynomial kernel for various problems; see [H [13] and [31 [HI [TO]. 

Among the problems for which the existence of polynomial kernels is still open, one can identify 
two major groups. The first group is centered around the Almost 2-SAT problem: Given a 
2-CNF formula F and an integer k, can you remove at most k clauses to make F satisfiable (or, 
equivalently, find an assignment under which at most k clauses are not satisfied)? This is a natural, 
expressive problem which (at least for purposes of parameterized complexity and kernelization) 
captures several problems of independent interest. For one thing, it directly expresses Odd Cycle 
Transversal (OCT); the existence of a polynomial kernel for OCT was a long-standing open 
problem, only recently solved by the present authors |25j . Less directly, a polynomial kernel for 
Almost 2-SAT has been shown to imply the same for Vertex Cover Above Matching, Konig 
Vertex Deletion for graphs with perfect matchings, and the RHorn-Backdoor Deletion 
Set problem from practical SAT solving (cf. [11[ I24|). among other problems; see |38 [ I16 |. [32] . We 
add to the list Vertex Cover Above LP, i.e., Vertex Cover parameterized by the size of the 
LP gap. For all of these problems, no polynomial kernel was previously known. 

The second group of open problems represents the class of graph cut problems. This is a wide 
class, where little is known regarding polynomial kernelization; problems for which polynomial ker- 
nelization is open include Directed Feedback Vertex Set (arguably one of the biggest open 
problems in kernelization; see |34| ) . Multiway Cut, and Multicut under various parameteriza- 
tions, as well as Group Feedback Arc/ Vertex Set, which again generalizes OCT. 

In this paper, we show polynomial kernels for Almost 2-SAT, and for a collection of graph cut 
problems, including Multiway Cut with a constant number of terminals and MULTICUT with a 
constant number of cut requests. We also show results about covering min-cuts and multiway cuts 
through a set of terminals using few vertices, which should be of independent interest. We make 
use of a lemma on representative sets from matroid theory, due to Lovasz |26j and Marx [28]. In 
particular, we show how to apply the lemma in irrelevant vertex arguments, i.e., how to use it to 
find vertices in cut problems which can be made undeletable without affecting the outcome. All 
our results are randomized, with failure probabilities which can be made exponentially small in the 
input size. 

Related work. Almost 2-SAT (also known as Min 2CNF Deletion) was showed to be 
FPT, runtime 0*(15 fe ), by Razgon and O'Sullivan [39]; this has been improved to 0*(9 h ) [38] . 
C*(4 fc ) [8], and C*(2.6181 fc ) [32]. It has an 0( v / logn)-approximation by Agarwal et al. p], and no 
constant factor approximation under the unique games conjecture [23] . 

Graph cut problems have been a catalyst for the development of new techniques in parameterized 
complexity, including the now ubiquitous iterative compression technique [40 [ 118] , the notion of 
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important separators [27] . and the shadow removal technique [29]. Our focus here is on Multiway 
Cut(/c), first showed to be FPT by Marx [27]. The currently fastest algorithm [8], runtime 0*(2 k ), 
uses an LP approach based on work of Guillemot [T7J; we also use some insights of the latter. 

As for polynomial kernelization of graph cut problems, in joint work with Cygan, Pilipczuk, 
and Pilipczuk [7J the present authors show, amongst others, that Multicut(&) and Directed 
2-Multiway Cut(/c) do not admit polynomial kernels unless the polynomial hierarchy collapses. 
Apart from this, and previous work [25] for Odd Cycle Transversal, little is known about ker- 
nels for cut or feedback problems beyond the kernelizations for Feedback Vertex Set, e.g, |44j . 

Matroids have seen little use as tools in parameterized algorithms (though see [30]), and only 
few papers address problems on matroids. However, recent work of Marx [28] on a parameterized 
matroid intersection problem also provides some results that are used in the current paper. Re- 
garding kernelization, to our best knowledge, previous work of the present authors [25] is the first 
and so far only application of matroid theory, using it to encode terminal cut functions of a (large) 
graph into small space. 

Irrelevant vertex arguments are a central part of the Disjoint paths algorithm of Robertson 
and Seymour [JI], which lies behind the celebrated FPT algorithm for testing graph minors. How- 
ever, the arguments used by Robertson and Seymour to locate irrelevant vertices are very different 
from those used in this paper (and the resulting bounds are far from polynomial). 

Our results. We show several applications of the representative sets lemma of Lovasz |26j and 
Marx [28] to polynomial kernelization, producing the first polynomial kernels for a range of impor- 
tant problems. First, we study Digraph Pair Cut, a constrained graph cut problem designed to 
capture (the iterative compression form of) Almost 2- SAT. We show that the representative sets 
lemma can be used to simplify Digraph Pair Cut down to a cut problem involving a polynomial 
number of terminals; from here, we can get a polynomial kernel either by compression methods as 
in [25] or a direct kernel by the cut-covering sets given below. We get the following. 

Theorem 1. Almost 2-SAT with a bound k on the solution size has a polynomial-time randomized 
compression into size 0(k 6 ), with one-sided error probability 0(2~ k ) and false positives only, and 
a randomized kernel with 0(/c 6 ) variables and failure probability 0(2~ n ). 

As mentioned above, this gives the first polynomial kernels for a range of problems. Next, we 
apply the representative sets lemma to the search for irrelevant vertices for graph cut problems. 
This gives two sets of results. The first relates directly to polynomial kernelization. 

Theorem 2. The following kernelizations are possible: Multiway Cut with deletable termi- 
nals^), with 0(k 3 ) vertices; s-Multiway CuT(fc), with 0(k s+1 ) vertices; s-Multicut(A;), with 
0(k^^^ +1 ) vertices; Group Feedback Vertex Set(&), for a group of s elements, with 0(k 2s+2 ) 
vertices. All results are randomized, with failure probability exponentially small in n. 

Finally, as a second set of irrelevant vertex results, we get interesting conclusions about covering 
min-cuts and multiway cuts in graphs. 

Theorem 3. Let G = (V, E) be a digraph and let S,T C V. Let r denote the size of a mini- 
mum (S, T) -vertex cut (which may intersect S andT). There exists a set Z C V , \Z\ = 0(\S\-\T\-r), 
such that for any A C S and B C T, it holds that Z contains a minimum (A, B)-vertex cut. We 
can find such a set in randomized polynomial time with failure probability 0(2~ n ). 
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Theorem 4. Let G = (V, E) be an undirected graph and X C V . For any s, there exists a 
set ZCF, \Z\ = 0{\X\ s+l ), such that for any partition X = {X\, . . . ,X S ) with pairwise disjoint 
subsets of X, it holds that Z contains a minimum multiway cut of X (i.e., a minimum cut C such 
that no pairs of sets Xi,Xj are connected to each other in G — C). We can find such a set in 
randomized polynomial time with failure probability (D(2~ n ). 

Organization. Our paper is organized as follows. Section [2] contains preliminaries, and Sec- 
tion [3] presents the matroid theory tools we use. Section |4] gives the first application, in the form of 
a polynomial kernel for Almost 2-SAT, and Section [5] gives irrelevant vertex-type consequences, 
yielding polynomial kernels for variants of Multiway Cut and the cut-covering sets of Theorems [3] 
and HI Sections [6] and [7] give further consequences of the Almost 2- SAT kernel and irrelevant 
vertex results, Section [8] contains delayed full proofs, and Section [9] concludes the paper. 

2 Preliminaries 

Parameterized complexity and kernelization. A parameterized problem is a language Q C 
X* x N; the second component of instances (x,k) is called the parameter (cf. [12} 133]). A param- 
eterized problem is fixed-parameter tractable (FPT) if there is an algorithm A and a computable 
function /: N — > N such that A decides (x,k) £ Q in time f(k)\x\ c> ^ 1 \ A kernelization of Q is a 
polynomial-time computable mapping K : S* x N — > S* x N: {x, k) \— > (x',k') such that (x,k) £ Q 
if and only if (x' , k') £ Q and with \x'\, k' < h{k) where h is a computable function; h is called the 
size of the kernel and K is a polynomial kernelization if h{k) is polynomially bounded. 

All kernelization results in this paper are randomized, i.e., there is a (small) chance for the 
reduced instance not to be equivalent to the input. In all cases, the failure is either one-sided, with 
false positives only, or occurs with probability exponentially small in the input size. The former 
type of kernels were called coRP-kernels in previous work [25] ; see [25] for a brief discussion on why 
they are compatible with the lower bound framework [3j Q3] , and [201 E] f° r more on randomized 
compression. The latter type is easily seen to be computable in non-uniform polynomial time, 
compatible with the exclusion of non- uniform compression [14] . 

Matroids. A matroid is a pair M = (E,X), where E is the ground set and IC2 E a collection 
of independent sets, such that: (i) £ X; (ii) if l\ C I 2 and 1% £ X, then I\ £ X; and (iii) if I±, I2 £ I 
and I-Z2I > |-^i|) then there exists some x £ (J2\/i) such that IiL){x} £ I. A set / C E is independent 
if / £ X, and dependent otherwise. A set B £ X is a basis of M if no superset of B is independent; 
a matroid may equivalently be defined by its set of bases. For a subset X C E, the rank r(X) of X 
is the largest cardinality of an independent set I C X. The rank of M is r{M) := r(E). 

Let A be a matrix over a field F and E be the set of columns of A. Let X be the set of all 
sets X C E of columns that are linearly independent over F (as vectors). Then (E,X) defines a 
matroid M, and we say that A represents M. A matroid is representable (over a field F) if there 
is a matrix (over F) that represents it. A matroid representable over some field is called linear. In 
this work, we will concern ourselves only with linear matroids. 

Gammoids. Let D = (V, A) be a digraph and let S,T <^V. The set T is linked to S if there 
exist \T\ vertex-disjoint paths from S to T; this allows paths of length zero, e.g., any set is linked 
to itself. Given any digraph D = (V, A) with source vertices S C V , the sets T C V which are 
linked to S in D form a matroid, a so-called gammoid [36] (see also [351 142]): we refer to it as 
(D,S). Marx [28] gave a randomized polynomial-time procedure for finding a representation of 
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a gammoid. The error probability can be made exponentially small in the size of the graph. By 
standard arguments, advice polynomial in n is sufficient to derandomize this step. 

Theorem 5 ([361 EE]). Let D = (V, A) be a directed graph, and let S C V. The subsets T C V 
which are linked to S form the independent sets of a raatroid over V . Furthermore, a representation 
of this matroid can be obtained in randomized polynomial time with one-sided error. 

Throughout the paper, (A, I?)-cuts may intersect A and B, unless otherwise noted. We also 
create sink-only copies of vertices; a sink-only copy of v G V, in D = (V, A), is a parallel copy v' of v 
which retains only the incoming edges of v (in the undirected case all edges are oriented inwards). 
These will be used, effectively, to require two paths to a vertex v G X in a set X linked to S. Note 
that adding v' to the graph has no effect on any independent (linked) set not containing v' . 

3 Tools from matroid theory 

Representative sets. The notion of representative sets plays an essential role in the paper. 

Definition 1. Given a matroid M = (E,I) and a collection S of subsets of E, we say that a 
subcollection S* C S is r-representative for S if the following holds: for every set Y C E of size 
at most r, if there is a set X £ S disjoint from Y with X U Y G I, then there is a set X* G S* 
disjoint from Y with X* UF£l. 

We will use representative set without specifying r to mean an (r(M) — s)-representative set. 
The following result is due to Marx [28], building on Lovasz [26J. 

Lemma 1 ([26, 28 ). Let M be a linear matroid of rank r + s, and let S = {S%, . . . ,S m } be a 
collection of independent sets, each of size s. Lf \S\ > ( 7 ^ s ), then there is a set Si G S such that 
S \ {S{} is r-representative for S. Furthermore, given a representation A of M, we can find such 
a set Si in time (m + H^ll)^ 1 ) (note that terms polynomial in ( r ~^ s ) are bounded by mP^ ). 

Closest cuts and gammoid rank. Our usage of representative sets and Lemma [1] is centered 
around the concept of closest sets, defined as follows. Let D = (V, A) be a digraph, and 5 C V. A 
set X C V is closest to S if X is the unique (S, X)-min-cut (or, if S and X are not disjoint, X \ S 
is the unique (S \ X, X \ 5)-min-cut). If so, we say that X is a closest set. For any set of 
vertices X, the induced closest set C{X) is the unique (S, X)-min-cut which is closest to S; this is 
well-defined by the submodularity of cuts, and can be found in polynomial time. If X is a closest 
set, then C(X) = X. Note that a closest set does not need to be a cut; there may not be any 
vertices except for X which are separated from 5 by X. 

Closest sets are a natural notion, but do not seem to have a fixed name; e.g., they occur in the 
bipedal stage of the MuLTlcUT(fc) algorithm of Marx and Razgon [29] , There are also similarities 
between closest sets and the concept of important separators [27]: for any set X closest to S which 
separates some vertex v ^ X from S, there is a corresponding important separator. However, the 
change of focus from separation to closeness means that the concepts behave differently. 

We make the following observations connecting closest sets to the rank function of a gammoid. 

Proposition 1 (*0). Let D = (V, A) be a digraph with a set of source vertices S C V, and 
let icy. Let D' be the result of adding a sink-only copy x' for every vertex x G X . The following 
hold. 

1 Proofs of statements marked with * are postponed to Section [8] 
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1. The set X is closest to S in D if and only if X + x' is independent in the gammoid (D',S) 
for every x £ X \ S. 

2. Let Xb be a maximal independent subset of X. A vertex v is reachable from S in D — C(X) 
if and only if Xb + v is independent in the gammoid (D' , 5). 

In particular, any (S, T)-cut X which is not a closest set, i.e., such that X + x' is dependent in 
the gammoid (D', S) for some x € X \ S, can be replaced by a different (S, T)-cut Z of at most the 
same size which does not contain x. 

4 Representative sets: A polynomial kernel for Almost 2-SAT 

In this section we show how to obtain polynomial kernels via representative objects. We consider 
a problem that we call Digraph Pair Cut, which captures the iterative compression version of 
Almost 2-SAT. We show a polynomial kernel for Digraph Pair Cut, using representative sets 
and a gammoid representation of graph cuts. Polynomial kernels for Almost 2-SAT and related 
problems follow via kernelization-preserving reductions. 

We now study the Digraph Pair Cut problem, to provide an 0*(2 k ) time algorithm and a 
randomized polynomial kernelization for it. Given a digraph D = (V, A) with a source vertex s, we 
say that a pair p = {u,v}, u,v E V, is reachable in D if both u and v are reachable from s (not 
necessarily via disjoint paths). The problem is then defined as follows. 

Digraph Pair Cut Parameter: k. 

Input: A digraph D = (V, A) with source vertex s £ V, a set of pairs P C (Y), and an integer k. 
Question: Is there a set X C\V \ {s} with \X\ < k such that no pair in P is reachable in D — X? 

Note that Digraph Pair Cut can be seen as a cut-based generalization of Vertex Cover. 
Specifically, if the graph D is an n-point star with s in the center, then the input is equivalent to 
a Vertex Cover instance with one edge for every pair in P. 

Theorem 6 (*). The Digraph Pair Cut problem can be solved in time 0*(2 h ). 

It is easy to see that optimal solutions for Digraph Pair Cut can be assumed to be closest 
to the source vertex s. We show that there exists a representative subset P* of the pairs P, of 
size 0(k 2 ), which determines whether or not any pair from P is reachable from s under some cut X 
closest to s. Given P* , and a gammoid encoding of the cut function between s and the vertices 
occurring in P*, we get a polynomial kernel for Digraph Pair Cut. Note that the bound on \P*\ 
is tight even in the case of Vertex Cover, where 0(k 2 ) edges is optimal [TO] . 

Lemma 2 (*). Let D = (V 7 , A) be a digraph, s £ V, k an integer, and P C ( 2 ) a set of vertex pairs. 
In randomized polynomial time (with failure probability exponentially small in the input size) we 
can find a set of 0(k 2 ) pairs P* C P (the representative pairs,), such that for any set X C V \ {s} 
closest to s of at most k vertices, the graph D — X contains a reachable pair p G P if and only if 
it contains a reachable pair p* £ P* . 

Proof (sketch). Proposition Q] implies that, for any cut X closest to s, a vertex v is reachable from s 
in G — X if and only if X + v is independent in the gammoid (D,S), where S consists of k + 1 
copies of s. 

By studying a matroid M consisting of two disjoint copies of (D,S), we can extend this fact to 
pairs of vertices. We use Lemma [1] to generate a set of representative pairs P* C P of size 0(k 2 ) 
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with respect to independent sets of size at most 2k in M. Now if P contains a pair that extends 
an independent set X' , with \X'\ < 2k, then P* contains such a pair too; the crucial sets X' 
in M are those which correspond to two copies of some closest cut X. This completes the proof 
since, by Proposition [Q reachability with respect to closest cuts is equivalent to extension of the 
corresponding independent set. □ 

We note that a generalization from pairs to g-tuples still holds. If applied to g-tuples, the lemma 
uses polynomial time to reduce a given set of g-tuples to a set of 0(k q ) representative g-tuples. 
This implies a polynomial kernel for the q-aiy generalization of Digraph Pair Cut, where the 
input contains a set of (/-tuples Q, and the task is to separate at least one member of every g-tuple 
from s. We call this variant Digraph q-Tuple Cut. Again, note that a bound of 0(k q ) is tight 
for g-tuples: it is straightforward to reduce from (/-Hitting Set for which total size of 0{k q ~ e ) for 
any e > is excluded unless NP C coNP/poly |10j . 

Now, let [D, s, P, k) be an instance of Digraph Pair Cut, and let P* be a set of representative 
pairs of P. Clearly, it is enough to find a set X of size k such that no pair in P* is reachable in D—X; 
we show that, in turn, the existence of such a set can be encoded into a gammoid on s and {J P*. 

Theorem 7 (*). There is a randomized polynomial-time compression algorithm for DIGRAPH 
Pair Cut which given an instance I = (D, s, P, k) and a positive real e computes a compressed 
representation of I of size (D(k 3 (k + log 1/e)). The success probability is at least 1 — e, and in the 
remaining cases the output encodes a positive instance. 

As a corollary, by standard arguments (cf. [5]), we get a polynomial coRP-kernelization for 
Digraph Pair Cut. A kernelization-preserving reduction from Almost 2-SAT to Digraph 
Pair Cut (see Section [6]) now finishes the kernelization of Almost 2-SAT, and the first half of 
Theorem [TJ Further reductions give polynomial kernels for, among others, Vertex Cover Above 
LP and RHorn-Backdoor Deletion Set; see Section El 

5 Finding irrelevant vertices: Polynomial kernels for cut problems 

In this section we extend our scope by showing how to use representative sets for the identification 
of irrelevant vertices in terminal cut problems. In this setting, a vertex is said to be irrelevant if 
there is at least one optimal solution which does not contain the vertex. Note that it is well possible 
that some irrelevant vertices are needed to build an optimal solution, but that any single one of 
them can be avoided. 

As a warm-up result, we will consider the Multiway Cut with deletable terminals(/c) 
problem, where the solution is allowed to contain terminals as well. For this simpler problem we 
are able to give a representative set characterization that covers not only all non-irrelevant vertices, 
but in fact contains an optimal solution. Thus a single iteration of the representative sets tool will 
produce a (randomized) polynomial kernel with 0(fc 3 ) vertices. 

Next, we move on to the main focus of the section, namely multiway cuts with a bounded number 
of terminals or partitions. We present a randomized polynomial kernel with 0(k s+1 ) vertices for 
the s-Multiway Cut(/c) problem, where the number of terminals is bounded by the constant s. 
For this problem we need a more "traditional" irrelevant vertex approach where only a single vertex 
is removed in each iteration. 

The approach has further consequences, in the form of the cut-covering sets of Theorems [3] and [4] 
- we find a small set of vertices which simultaneously contains an optimal solution for all min-cut 
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questions involving T (and a similar conclusion for multiway cuts). In particular, this result lets 
us replace the gammoid encoding of graph cuts of [25] by a "proper" kernelization rule. 

In Section [7J we give two further applications, namely polynomial kernels for s-Multicut(/c) 
and T-Feedback Vertex Set(/c) for groups V with at most s elements. 

5.1 Multiway Cut with deletable terminals 

In this section we focus on the Multiway Cut with deletable terminals (k) problem (DT- 
MWC(fc)), where the task is to separate a set T of terminals by deleting at most k vertices (including 
terminals). It can be easily seen to be equivalent to Multiway Cut restricted to terminals of degree 
one. The problem is NP-hard by a simple reduction from Vertex CoverU 

Let (G, T, k) be an instance of DT-MWC(fc), with G = (V, E) and T C V. We will use the tool 
of representative sets to identify a set of 0(k s ) representative vertices V* such that V*UT contains 
an optimal solution. But first, using a result of Guillemot [17], we limit \T\ in terms of k. 

Lemma 3 (*). Let (G,T,k) be an instance of Multiway Cut with deletable terminals(/c). 
An equivalent instance (G',T',k') with k! < k and \T'\ < 2k' can be computed in polynomial time. 

For the representative set, we form a graph G' by adding to every non-terminal vertex v two 
sink-only copies v' and v", and form the gammoid on G' with T as source set. We form the set S 
of triples {v,v' ,v"} for v G V\T, and let S* be a /^-representative set of S. Further, we let V* C V 
be the set of vertices v with {v,v',v"} G S*. We know that |V*| = C(/c 3 ); we argue (again) that 
there is an optimal solution X such that (X \T) C V* . 

Lemma 4 (*). Let (G,T,k) be an instance of Multiway Cut with deletable terminals (k), 
and let X be a minimum size multiway cut (of size k) which as a secondary criterion has a maximum 
size intersection with T . Let V* be a representative set of V , as constructed above. Then for 
every x G (X \ T), the set X + x' + x" is linked to T , and ICTU7*. 

Proof (sketch). The proof is given by matching arguments on an auxiliary bipartite graph H = 
(X' UT',E H ) where X' = X \ T and T' = T \ X (skipping deleted terminals t G T n X), and 
with an edge {x,t} if x can reach t in G — (X — x). There are two crucial facts. First, any 
pseudomatching M C Eh such that edges in M share no endpoint in T' corresponds to \M\ paths 
from T' to X' which overlap only on vertices in X'; this follows from the fact that each component 
of G — X contains at most one terminal. Second, it follows from the maximum overlap of X with T 
that any subset S C X has at least |»S| + 2 neighbors in H, since instead of S we could delete all 
but one terminal reachable from S. An application of Hall's Theorem completes the proof. □ 

By this lemma, we may set all vertices of V \ (V* U T) as undeletable, without changing the 
existence of a solution of size at most k. The easiest way to achieve this is by adding shortcut 
edges {u, v} between any two vertices u, v G V* U T which are connected by a path with internal 
vertices from V , and subsequently deleting V from the graph. Thus we get a polynomial kernel. 

Theorem 8. Multiway Cut with deletable terminals (A;) has a randomized kernel ofO(k 3 ) 
vertices. The error probability can be made exponentially small in n; all errors are false negatives. 

2 Given a graph G, create G 1 by attaching a terminal v' to each vertex v. Multiway cuts in G' correspond to vertex 
covers of G and vice versa. 
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5.2 Bounded terminals Multiway Cut 

We will now focus on s-Multiway CuT(fc), i.e., the variant of Multiway Cut(/c) where the 
number of terminals is bounded by some fixed constant s; we show a randomized polynomial kernel 
with 0(k s+1 ) vertices. 

Unlike for Multiway Cut with deletable terminals(£;) we are not able to directly form a 
set of representative vertices which is guaranteed to contain an optimal solution. Instead we show 
how to identify irrelevant vertices via representative sets techniques, and get the kernel by iterating 
computation of representative vertices and deletion of a single irrelevant vertex. 

We begin by identifying a condition under which at most 0{k s+1 ) vertices are representative, 
and such that for any single non-representative vertex v of G there is an optimal solution not 
containing v. Let T = {tx, . . . ,t s } be the set of terminals, arbitrarily ordered. Assume that 
the LP-based reductions of Guillemot [17] have been applied, so that the terminals have disjoint 
neighborhoods and |JV(T)| < 2k (see Lemma [3]). Our condition of representativeness, called high 
reachability, is defined as follows. 

Definition 2. Let (G,T, k) be an instance o/s-Multiway CuT(fe) with G = (V,E) and a set T C 
V of at most s terminals. Let X C V \ T be a multiway cut of at most k vertices. Furthermore, 
let G' be the directed graph obtained from G by adding one sink-only copy v' for each v G V \ T . 
We say that v £ X is highly reachable under X if X + v' + N(t) is independent for every t G T, 
in the gammoid (G',N(T)). We say that v G V is highly reachable if there exists some multiway 
cut X of at most k vertices such that v is highly reachable under X . 

Intuitively, after removing any one terminal from T, the solution X should still be independent 
with two vertex-disjoint paths to v. Though there are other ways to express this, the exact wording 
of Definition [2] will be useful for the group feedback vertex set problems in Section 17.21 

We may identify the highly reachable vertices in polynomial time. 

Lemma 5. We can identify a set of 0(k s+l ) vertices, in randomized polynomial time with error 
probability 0{2~ n ), which contains all highly reachable vertices. 

Proof. Create a gammoid which is the disjoint union of s + 1 layers 0,1, ... ,s, where layers 1, . . . , s 
are copies of the gammoid (G',N(T)), where G' is as in Definition (2) For layer we take the 
uniform matroid of rank k on base set V, i.e., we take (V, (<j.))- Then, create a set of tu- 
ples (v(0), v'(l), v'(2), . . . ,v'(s)) for every v G V(G), where v(0) is the copy of v in layer and 
each v'(i), with 1 < i < s is the sink-only copy of v in the i:th matroid layer. Use Lemma [T] to 
identify a O (k s+1 )-sized set of representative tuples (note that the rank can be bounded by 0(ks)). 

We show that the tuples (v(0), v'(l), v'(2), . . . , v'(s)) corresponding to highly reachable ver- 
tices v (with respect to some solution A") will be the unique choice for extending some particular 
independent sets. Hence they must be contained in the representative subset of the tuples. 

Let X be a solution to the instance, and let v G X be highly reachable under X. Consider the 
set X* which contains the following elements in the s + 1 layers: 

1. In layer it contains all vertices u£l \ {v}; these are at most k — 1. 

2. In each other layer i G {1, . . . , s} it contains the vertices X + N(ti). 

By assumption the tuple (v(0), v'(l), v'(2), . . . , v'(s)) extends this set (to an independent set). Note 
that v(0) is not among the vertices chosen for layer 0, since those are only the copies of X \ {v }. 
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Now assume that there is some vertex u 7^ v such that (u(0),u'(l), u'(2), . . . , u'(s)) extends X* . 
If u £ X, then its tuple intersects X \ {v} in layer 0. If u ^ X, then it is reachable from only 
one terminal, say ij, in G — X. Hence, in layer i, the vertex u'(i) cannot be added to X* since we 
already request paths which saturate (the layer i copies of) X and N(ti). 

Thus v is the unique vertex whose tuple can extend the independent set X* , implying that 
its tuple will be among the representative tuples computed via Lemma El Thus the correspond- 
ing 0(k s+1 ) vertices contain all highly reachable vertices. □ 

Now, we need to show that any single vertex which is not highly reachable can be removed 
without harm, i.e., that such a vertex is irrelevant. It suffices to show that any optimal solution X 
containing v can be converted into a solution of the same size and avoiding v. 

Lemma 6 (*). Let v be a vertex which is not highly reachable under any multiway cut of size 
at most k and is not contained in N(T). Then there is an optimal multiway cut which does not 
contain v. 

Proof (sketch). Let X be an optimal multiway cut for G with v £ X. Let t £ T such that X + 
v' + N(t) is not independent in (G',N(T)), according to Definition [2j It follows that there is a 
minimal cut C of size at most |X| + |iV(i)| separating N(T) and X + v' + N(t). The proof goes by 
constructing a new multiway cut X' from C and X, such that \X'\ < \X\ and v ^ X' . □ 

To conclude our result, we only need one simple reduction rule. 

Reduction Rule 1. Let {G = (V,E),T,k) be a multiway cut instance, and let v £ V \ (T U N(T)) 
be a vertex which is not identified as potentially highly reachable by Lemma\M Create G' from G 
by replacing N(v) by a clique and removing v from the graph. Return (G',T,k). 

Theorem 9 (*). s-Multiway CUT(fc) has a randomized kernel of 0(k s+1 ) vertices. The error 
probability can be made exponentially small in n, and errors are limited to false negatives. 

The kernel is, arguably, near- combinatorial, in the sense that it consists of a reduction rule 
which performs simple direct modifications to the input graph, while the condition of applicability 
for the rule may be seen as non-combinatorial, as it at the moment requires a representation of a 
gammoid and the multilinear algebra of Lemma [TJ 

5.3 Covering Graph Cuts 

In this section, we will adapt the above approach to prove the more general Theorems [3] and [4] about 
covering terminal min-cuts and minimum multiway cuts. In essence, we find that the representative 
sets condition of Lemma [5] gives us much more power than we need for kernelization. We begin 
with the statement for cuts in directed graphs. 

Proof of Theorem Let G, S, T be as given; we may assume w.l.o.g. that S are sources and T 
sinks in the graph (by adding source vertices before S and sink vertices after T; this does not 
modify any cuts). Let a vertex v £ V be essential if there are sets A C S, B C T such that every 
minimum [A, U)-vertex cut contains v, and irrelevant otherwise. Let ACS and B CT, and let Ca 
resp. Cb be the minimum (A, B)-veitex cut closest to A resp. to B. The idea of the proof is that 
vertices essential for an (A, B)-cut are exactly those in Ca H Cb, and, in turn, by Proposition [TJ 
we can use representative sets to find such vertices. 
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Claim 1 (*). Let r denote the size of a minimum (S,T)-vertex cut (which may intersect S and T). 
We can find a set Z C V of 0(\S\ ■ \T\ ■ r) vertices which includes all essential vertices. 

Now, any vertex in V \ [Z U 5 U T) may safely be made undeletable (as in Reduction Rule [T]) 
without changing the size of any minimum (A, B)-veitex cut. By induction, this gives a set of 0(15*1 • 
\T\ ■ r) vertices which covers a minimum (A, B)-vertex cut in G for every A C S, B C T, as 
requested. □ 

By a construction in [25] , we can transfer this result to preserving minimum cuts through a set 
of terminals, allowing for deleting terminals. 

Corollary 1 (*). Let G = (V,E) be a directed graph, and X C V a set of terminals. We 
can identify, in polynomial time, a set Z o/0(|X| 3 ) vertices such that for any S,T, R C X , a 
minimum (S,T) -vertex cut in G — R is contained in Z. 

Regarding tightness, it is easy to show that £1{\S\ ■ \T\) vertices are necessary, even in a very 
simple setup: Let S and T be disjoint sets of vertices of weight 2, and for every u G S,w £ T 
create a connecting vertex v UtW of weight 1, with N(v U:W ) = {u,w}. Then, the sets A = {u} 
and B = {w} show that we must include v U)W to preserve the unique minimum (A,B)-cut. This is 
easily converted into a setting without weights, by copying vertices in S and T into pairs of twins. 
We do not know whether the further factor of r in our upper bound is necessary. 

The above gives us direct polynomial kernels for a number of problems (as opposed to the 
implicit polynomial kernels resulting from polynomial compression due to NP-hardness reductions, 
e.g. as in [25]). 

Corollary 2 (*). Digraph Pair Cut has a kernel of 0(k 4 ) vertices; Almost 2-SAT has a 
kernel of 0(k e ) vertices. The signed graph generalizations of Odd Cycle Transversal and 
Edge Bipartization, where edges are marked as even or odd and the goal is to kill all odd-parity 
cycles, have kernels with 0(k 4 ' 5 ) respectively 0(k 3 ) vertices. All the above kernels are randomized, 
and can be derandomized with polynomial advice. 

Covering multiway cuts. We now turn to a multiway cut variant of the above. To state our 
results, we define a multiway cut of a partition as follows: Let X = (X%, . . . ,X S ) be a tuple of 
pairwise disjoint sets of vertices, X := IjXj. A multiway cut of X is a set of vertices C C V, 
which may intersect X, such that for i ^ j, there is no path between (JQ \ C) and (Xj \ C) 
in G—C. In other words, it is a multiway cut of the instance produced by adding terminals ti,...,t s , 
with N(ti) = X{. We show that we can find a set which covers a minimum multiway cut for every 
partition X = (X 1: . . . , X s ) with (Jj -X* C X. 

Proof of Theorem \4\ The proof will again be an irrelevant vertex construction, using the repre- 
sentative set condition of Lemma [5J as guide. Call a vertex v £ (V \ X) essential if there is a 
partition X = (X\, . . . ,X S ) of X' C X such that every minimum multiway cut of X contains v. As 
in Section T5.21 we find that such an essential vertex must be highly reachable under the partition X. 
Essentially, what remains to be proven is that Lemma [5] does not really depend on knowing the 
partition X in advance. 

Claim 2 (*). We can identify (in polynomial time) a set of 0(\X\ S+1 ) vertices which includes all 
essential vertices. 

The rest of the proof now proceeds as for Theorem [3j □ 
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6 Implications for Almost 2-SAT and related problems 



6.1 Almost 2-SAT Kernelization 

We will now give a randomized polynomial kernel for Almost 2-SAT, by reducing the iterative 
compression form of the problem to DIGRAPH PAIR Cut. By bootstrapping the compression step 
with an approximate solution of appropriate size, we get a polynomial kernel. 

Let us begin with explicitly defining the iterative compression form of Almost 2-SAT. For a 
2-CNF formula J 7 , let a deletion set X be a set of variables of J 7 such that removing every clause 
incident to X leaves a satisfiable instance. Note that the problem may alternatively be defined by 
deleting at most k clauses; these two forms are equivalent under simple transformations (cf. |29j). 

Almost 2-SAT Compression Parameter: \X\ 

Input: A 2-CNF formula J 7 , an integer k, and a deletion set X for J 7 . 
Question: Is there a deletion set X' for J 7 of size at most kl 

We now show the reduction. 

Lemma 7. There is a polynomial parameter transformation from Almost 2-SAT Compression, 
with input solution X and parameter k, to Digraph Pair Cut with parameter k' = \X\ + k. 

Proof. Let J 7 be a 2-CNF formula and X a deletion set for J 7 ; assume that \X\ > A; or else 
return a dummy yes-instance (this ensures a polynomially bounded parameter growth). Further, 
assume (by variable negation) that the remaining formula after X has been deleted is zero-valid, 
i.e., satisfied by the assignment which sets all variables to false. Let V be the set of variables 
of T. We define a Digraph Pair Cut instance (D,s,P,k) as follows. Let the vertex set of D 
be (V \ X) U {s} U {xo,xi : x £ X}. Initialize P to {{xq,xi} : x £ X} and create edges (s, Xi) 
for i = 0, 1 for every x £ X. Thus, variables x £ X are split into literals, and we require that at 
least one of the literals for every x £ X is removed, implying either an assignment to x or that x is 
deleted (where the latter requires two vertex deletions). By the assumption of zero- validity, we are 
now able to use directed edges and vertex pairs to enforce that any solution to the Digraph Pair 
Cut instance, removing \X\ + 1 vertices for t < k, corresponds to a deletion set of size t for J 7 . 

Concretely, transfer J 7 to the Digraph Pair Cut instance as follows. For every 2-clause in J 7 
contained in V \ X, transfer it unchanged: put implications as directed edges in D, and negative 
2-clauses as pairs in P. Note that positive 2-clauses do not occur here by being zero-valid. For 
clauses with one endpoint in X, create edges and pairs depending on the type of clause: assume 
in the following that x £ X and y ^ X. For a clause (x — > y), create a directed edge (x\,y); for 
a clause (y — > x), create a pair {xo,y}; for a clause (x V y), create a directed edge (xo,y)\ and 
for a clause (px V -*y), create a pair {x±,y}. Finally, for 2-clauses within X, create pairs, e.g., a 
clause (x — > x') becomes a pair {x\,x' }. Note that each edge incident on a variable Xi in D is 
either (s,Xi) or an outgoing edge (xi,y). We argue that the resulting Digraph Pair Cut instance 
has a solution of size k + \X\ if and only if J 7 has a deletion set of size k. 

On the one hand, let X' be a deletion set for J 7 , and let / be a satisfying assignment to the 
remaining formula. Create a set of vertices Z in D as follows: for every x £ X D X' , add xq and x\ 
to Z\ for x £ X \ X' ', with f{x) = i, add x\-i to Z; and for every y £ X' \ X, add y to Z. Now Z 
has the right size. Assume that Z is not a valid solution to the Digraph Pair Cut instance, i.e., 
that some pair p £ P is reachable in D — Z. First, we find that every vertex in V \ X which is 
reachable in D — Z must be true in /, as otherwise / would falsify a clause. Now consider the 
pair p. In every case, there is a corresponding clause in J 7 which is falsified by /. 
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On the other hand, let Z be a solution to the Digraph Pair Cut instance, and let X* contain 
those variables x € X for which both literals are deleted. Create an assignment / to variables 
corresponding to non-deleted vertices, by letting f(x) = i if Xi is non-deleted, and for y G V \ X, 
let f(y) = 1 if y is reachable from s in D — Z, and f{y) = otherwise. Let the deletion set 
be X' = [Z n (V \ X)) U X* . Now / assigns a unique value to every variable not in X'; consider 
a 2-clause in T falsified by /. Every such clause corresponds to a directed edge or a pair, in a way 
which would contradict Z being a solution to the Digraph Pair Cut instance. 

Thus we find that the reduction is correct. □ 

By this reduction and Theorem[7J we get a randomized polynomial-time compression of Almost 
2-SAT Compression to size <5(|X| 4 ). To finalize the Almost 2-SAT kernel, we only need an 
initial deletion set X of size poly(fc) to bootstrap the kernelization from. By Agarwal et al. [I], 
Almost 2-SAT has an O{yf\ogn) -approximation (recall that the clause- and variable-deletion forms 
are equivalent). Using the same tricks as for Odd Cycle Transversal [25] . we may combine 
this with an FPT algorithm to get a solution of size 0(k 1 ' 5 ). We may then get a polynomial kernel 
in the form of an Almost 2-SAT instance via an NP-completeness back-reduction. 

Theorem 10. Almost 2-SAT has a randomized polynomial-time compression to size 0(k 6 ), imply- 
ing a polynomial coRP -kernelization. 

6.2 Implications of the Kernel 

In this section, we show some consequences of the Almost 2-SAT kernel. As Almost 2-SAT is 
quite a general problem, several interesting problems have been shown to reduce to it via parameter- 
preserving reductions. Many of these reductions are surveyed in [38]. We get the following corollary. 

Corollary 3. The following problems have randomized polynomial kernels: Vertex Cover Above 
Matching, Vertex Cover parameterized by Konig Deletion Set, Konig Vertex Dele- 
tion restricted to input graphs having perfect matchings, and RHorn-Backdoor Deletion Set. 

Proof. All listed problems are NP-hard even with parameter value encoded in unary (since the 
maximum cost of any solution is already polynomially bounded). It suffices to refer to the ap- 
propriate polynomial parameter transformations (cf. [5]): Vertex Cover Above Matching 
and Almost 2-SAT(fe) are equivalent under PPTs |38j . The following problems are reducible to 
Vertex Cover Above Matching under PPTs: Konig Vertex Deletion restricted to input 
graphs having perfect matchings [31] and RHorn-Backdoor Deletion Set [16] . 

For Vertex Cover parameterized by Konig Deletion Set, i.e., given X such that G — X 
is a Konig graph, parameterized by \X\, one may observe the following: If the target value, say £, 
is larger than a minimum vertex cover of G — X (equal its maximum matching size, say m(G — X)) 
plus \X\ then the answer is trivially yes. Otherwise, from £ < \X\ + m(G — X) it follows that £ — 
m(G) < £ — m(G — X) < \X\. Hence, by accounting for the target value as k := £ — m(G), i.e., 
"above maximum matching", the parameter can only decrease and we have an immediate PPT. □ 

The result for Vertex Cover parameterized by Konig Deletion Set generalizes the 
best known kernelization result for Vertex Cover parameterized by a Feedback Vertex 
Set |22j. There the input contains X such that G — X is a forest, parameterized by \X\; as forests 
are also Konig, the parameter can only decrease by reducing to parameterization by Konig deletion 
set. The same is true in comparison to parameterization by an odd cycle transversal X. 
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We also provide a new reduction result in the form of a PPT from Vertex Cover Above LP to 
Vertex Cover Above Maximum Matching. This implies that the two problems are equivalent; 
the reverse direction holds trivially since the parameter can only decrease when comparing the target 
value to the LP cost instead of to a maximum matching. 

Lemma 8. There is a polynomial parameter transformation from Vertex Cover Above LP to 
Vertex Cover Above Maximum Matching. 

Proof. Let (G, k) be an instance of Vertex Cover above LP, i.e., asking whether G has a vertex 
cover of size at most LP(G) + k, where LP(G) denotes the minimum cost of a fractional vertex 
cover for G. Let i = LP(G) + k. It is well known that the linear programming relaxation for 
Vertex Cover is half-integral, so we will only consider such solutions. 

It suffices for us to show that G contains a large matching. For the argument let us fix any 
minimum integral vertex cover X of G, and assume that the instance is yes, so \X\ < £. Create a 
graph G' by adding a set of 2k + 1 vertices Y, and making X and Y fully adjacent. Let us consider 
optimal fractional vertex covers of G' . We may clearly assume that each vertex of Y is assigned 
the same value in such a cover. We also note that X is a vertex cover of G' , hence LP(G) < \X\. 

If all vertices of Y are assigned |, or all are assigned 1, then the total cost is at least 

LP(G) + ||Y| > LP(G) + k>£> \X\, 

using the simple fact that G' always incurs a cost of at least LP(G) on the induced subgraph G 
in G' . Thus, optimal fractional covers of G' will assign zero to all of Y, since otherwise a cost 
exceeding X is incurred. Setting Y to zero already forces setting X to one, giving the unique 
optimal fractional solution (with all further vertices also set to zero). 

Thus the graph G' has a vertex cover X of size matching the minimum fractional cost LP(G'). 
It is easy to see that it also has a matching of size \X\ = LP(G'): Assume that the vertices of X 
cannot be matched into V{G') \ X. It follows, by Hall's Theorem, that there is a subset X' of X 
with \Nqi(X')\ < \X'\. But then assigning | to X' and N(X') would give a cheaper fractional 
solution, a contradiction. 

It follows that G' has a matching of size \X\, implying that G = G' — Y has a matching M of 
size at least \X\ — \Y\ > LP(G) — (2k + 1). Thus we get that our target value £ exceeds the size of 
a maximum matching by at most I - \M\ < I- (LP(G) - (2k + 1)) = I- LP(G) + 2k + 1 < 3k + 1. 

Note that we do not need to know whether (G, k) is yes, nor do we need to know X. We may 
compute a maximum matching M of G and check whether it meets at least the bound which we 
proved for the yes-case. If so, then (G, k') with k' = I — \M\ = LP(G) + k — \M\ is an equivalent 
instance of Vertex Cover Above Maximum Matching with k' < 3k + 1. Otherwise, we know 
that (G,k,£) is no, and we return a dummy no-instance. □ 

As an immediate corollary we get that Vertex Cover Above LP admits a polynomial kernel. 
Corollary 4. Vertex Cover Above LP admits a randomized polynomial kernelization. 

7 Implications for mult i- terminal cut problems 
7.1 Bounded terminal pairs multicut 

Now, we can extend the result for s-Multiway Cut(/c) to also give a polynomial kernel for s- 
Multicut(/e), i.e., Multicut(&) restricted to instances having at most s terminal pairs. The key 
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fact is that an optimal multicut X for some instance (G, T, k), where T C („) of size at most s, is 
also a multiwaycut for some partition of the terminal vertices V(T) = {v \ 3u : {u, v} € T} in which 
no set contains both vertices of some terminal pair. By considering partitions that are maximally 
coarse, i.e., merging any two sets would violate this condition, we get a relation to multiway cut 
instances with up to (roughly) y/2s terminals. 

Theorem 11. s-Multicut(A;) admits a randomized polynomial kernel with f(s)k^^^^ +1 vertices. 
The error probability can be made exponentially small in n, and errors are limited to false negatives. 

Proof. Let (G, T, k) be an instance of s-MuLTiCUT(fc) with T = {{ai,b±}, . . . , {a r , 6 r }}, for r < s. 
Consider an optimal multicut X of size at most k. Clearly, in G — X there is no path from aj to hi 
for any i. Let T = T\ U . . . U T p be any partition of {a\, . . . , a r , b\, . . . , b r } into p nonempty sets, 
such that there is no path from u to v in G — X for any u G Tj and v G T~ with i ^ j. Hence X 
is a multiway cut of T = (T\, . . . ,T P ). Observe also that merging any two sets Tj and Tj gives a 
coarser partition, for which X is still a multiway cut. 

We will now make T maximally coarse, under the restriction that no two terminals and b{ 
are contained in the same set of the partition. This is easily achieved by iteratively merging any 
two sets of T which do not (together) contain both and hi for any i. We obtain a partition T* = 
(T[, . . . ,Tp*) such that no two sets can be merged since they together contain a terminal pair 
(and recall that X is a multiway cut for T*). It follows that p* < \\^2s~\, since there are r < s 
terminal pairs and each pair T/, Tj is prevented from merging by a different terminal pair (so 

if p* > \/2s + 1, there would have to exist ( v/ ^ +1 ) > s terminal pairs). Thus, X is a multiway cut 
of {a±, . . . , a r , b\, . . . , b r } into at most [\/2s] groups. 

To complete the proof, we can apply (i) the approach used for s-Multiway CUT(fc) or (ii) 
Theorem |4] for covering multiway cuts. We will briefly sketch both options. 

(i) To use the s-Multiway Cut (A;) approach we need to change the way of identifying an 
irrelevant vertex. Essentially, for each partition T of {01, . . . , a r , b±, . . . , b r } into p < \V2s~] sets, 
none of them containing a terminal pair, we have to preserve the possibility of having a multiway cut 
of size at most k. Thus, for each partition, identify each set into a single terminal, and generate a set 
of representative vertices (covering the highly reachable vertices). However, only those vertices are 
declared irrelevant which are not in this set for any of the partitions. Since there are at most [\/2s] 2s 
such partitions, there are at most 0{ \\f2s\ 2s k}^^ +1 ) vertices which are contained in at least one 
representative set. Hence, by iterating this procedure, removing one irrelevant vertex each time 
(and making its neighbors a clique), we reduce to that number of vertices. 

(ii) To apply Theorem [J] we need to make the vertices in {a\, . . . ,a r ,b±, . . . ,b r } effectively 
undeletable. To this end, we simply create a total of k + 1 copies of each vertex and connect 
them into a clique. We apply the theorem to the set {ai(l), . . . , a±(k + 1), 02(1)) • • • , b r (k + 1)} (of 
size 2r(k + 1) < 2s(k + 1)) for partitions into up to |~\/2s] sets. We get a set Z of size 0((2s(k + 
i^Tv^l+i) containing a multiway cut for each such partition. 

For both options, errors can only be introduced by making too many vertices effectively un- 
deletable (for (ii) this happens when Z does not contain an optimal solution), which can result in 
false negatives but no false positives. □ 
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7.2 Group feedback vertex set for fixed groups 

As a further consequence we show how to obtain a polynomial kernel for the following group 
cut problem. Let (r, (8>) be a finite group with unit element ±r E I\ and let T = {ai, . . . ,a s } 
where a± = lr- 

T-Feedback Vertex Set(/c) Parameter: k. 

Input: A directed graph D = (V, A), an edge labeling eft : A — > T, and an integer k. 
Question: Is there a set X of at most k vertices such that D — X has a consistent labeling, i.e., a 
function ir : V — > T such that ir(u) <g) 4>((u, v)) = tt(v) for all arcs (u, v) € A \ F. 

Equivalently to the above, one may ask for a set of at most k vertices which meets all nonnull 
cycles, i.e., cycles in the underlying undirected graph whose edge labels sum to a ^ lr; the label 
of an edge which is used in reverse direction is thereby interpreted as the inverse. This and similar 
problems were, e.g., studied by Guillemot |17] and Chudnovsky et al. [6j. Note that Odd Cycle 
Transversal is an instance of this problem, with the group Z%. Guillemot [T7] shows that Group 
Feedback Arc Set(/c) and Group Feedback Vertex Set(/c + |T|) (see below) are FPT, but 
leaves open the questions of polynomial kernels, and whether Group Feedback Vertex Set(A:) 
is FPT. Our current results make partial progress towards the kernelization questions, but note 
that there is an easy reduction from Multiway Cut(/c) to Group Feedback Vertex Set(&): 
Use a group of size at least matching the number of terminals. Make a clique on the terminals and 
add correct edge labels to enforce a different value for each terminal. (Also make the terminals 
"heavy" by making budget+1 copies.) All other edges have the unit label. It is easy to verify that 
group cuts correspond directly to multiway cuts. 

Other variants of the problem include Feedback Arc Set variants, where we ask to remove a set 
of k edges instead of vertices, and Group Feedback Vertex Set, where the group T is a part 
of the input (or even given via oracle access). Note that the edge deletion variant reduces to the 
vertex deletion variant through standard observations. 

We remark that the graph is directed only as a technical requirement, i.e., to define a direction 
for the element labels. In all practical aspects (when referring to cycles in D, etc), we will treat 
the graph as an undirected graph (the underlying undirected graph of D), with edge labels that 
read differently in different directions. Thus, if (u, v ) E D with label a, then we act as if in 
addition (v,u) E D with label a -1 . Since we are dealing with the vertex deletion variant, this does 
not cause any problems. 

As a first step towards our kernel we will require an efficient procedure for producing an initial 
solution of size k ^ 1 ' . This we do by reducing our problem to a Vertex Multicut setting, by 
splitting every vertex in D into s copies, in a way that can be seen as splitting vertices into "literals" . 
Known approximation results for Vertex Multicut then provide us with the initial solution we 
need. 

Lemma 9. Let (D, <f>, k) be an instance o/T-Feedback Vertex Set(&), for a fixed group T with s 
elements. There is a polynomial-time procedure that either proves that the instance is negative or 
produces a solution of size 0(s 2 k 2 ). 

Proof. Let D = (V, A). We construct a graph G = (V , E) with V = {v(a) : v E V, a E T} and E = 
{{u(a),v(a<8>(j>((u, v)))} : (u, v) E A, a E T}. These edges are chosen such that if {u(a), v(a')} E E, 
and if tt is a consistent labeling which labels both u and v, then ir(u) = a if and only if ir(v) = a' . 



16 



Finally, the set of terminal pairs is T = {(u(a) , u(a' )) : u £ V, a, a' £ T, a ^ a'}. We show that the 
optimal multicut of (G, T) is within a factor of s of the optimal feedback vertex set of {D, (ft). 

Claim 3. Let X be a feedback vertex set for (D, (ft) of size k. Then X' = {v(a) : v £ X,a £ T} is 
valid multicut of (G, T) of size sk. 

Proof. Any path between terminal pairs in G — X' forms a non-null cycle in D — X. □ 

Claim 4. Let X be a multicut of (G,T). Let V(X) = \J v ( a ) e xi v } ■ Then V(X) is a solution to 
the group cut problem of size at most \X\. 

Proof. Consider a non-null cycle in D — V(X), and a member v of this cycle. Arbitrarily select a 
literal v(a) of v, and let S be the set of literals propagated along the cycle, i.e., if u is neighbor 
to v in the cycle with an edge that proscribes tt(u) = ir(v) ® a', then add the literal u(a (8> a') to S. 
Do this for one revolution around the cycle, so that S contains two literals of v. The literals of 5 
form a path in G — X between different literals of v, which contradicts X being a multicut. □ 

Now, if {D, (ft, k) is positive, then an algorithm of Gupta [19] can be used to produce a multicut X 
for (G, T) of size 0(s 2 k 2 ), which can then be projected back into an initial feedback vertex set V(X) 
for (D, (ft) of the same size. Otherwise, if X is too big, we may reject {D, 4>,k). □ 

For the remainder of the section let us fix one instance (D, (ft, k) of T-Feedback Vertex Set(&) 
and let X be an approximate solution of size at most 0(s 2 k 2 ) obtained via Lemma [9l A simple 
normalization is possible, in the form of an untangling: Fix any consistent labeling tt: (V\X) — > T 
for D — X. We would like to get an equivalent instance but in such a way that assigning lp to all 
vertices is feasible for D — X. To this end, intuitively, we add to each vertex v the inverse of tt(v) 
and make the correct changes for all arcs incident on v: For arcs (it, v) we add the inverse of ir(v) 
to 4>((u,v)) (thereby defining a new arc labeling </>'). For arcs (v,u) we add n(v) to <f>((v,u)). Note 
that this is performed taking all of D into consideration, not only D — X. It can be easily verified 
that doing this vertex by vertex does not change the existence of a consistent assignment of D — Y 
for any set of vertices Y. In the end, all arcs in D — X will be labeled with lp and assigning lp 
(or any other single group element) to all vertices is a consistent labeling of D — X. The labels 
of arcs incident to X are arbitrary. For ease of notation, we will assume that this operation has 
already been performed on [D, (ft, k) (as a nice side effect the orientation of arcs in D — X becomes 
immaterial as lp is self- inverse) . We also assume that arcs with exactly one endpoint in X are 
outgoing from X. 

Algorithmically, this implies that we may solve the problem by a set of calls to Multiway Cut 
with s terminals: Consider each of the (s + ly x \ possible decisions to either delete x S X or fix 
an assignment ir(x). Then, for every neighbor v of a non-deleted vertex x £ X, this implies an 
assignment to v. Thus, create a graph with s terminals ct\, . . . , a s , by starting from the underlying 
undirected graph of D — X, adding said terminals, and adding an edge (a, v) for every vertex v 
such that some assignment tt(x) implies ir(v) = a. It is not hard to see that asking for multiway 
cuts over the resulting graphs produces a feedback vertex set for (D, (ft, k). To incorporate this into 
a kernel, we need some modifications to incorporate the vertices of X, but the idea is the same. 

Definition 3. Let (D, (ft, k) be a T-Feedback Vertex Set(/c) instance, normalized with respect 
to a feedback vertex set X. The graph G(D, (ft, X) is an undirected graph with vertex set V = 
(V \ X) U {ai,...,Q! s } U {x(a,a') : x £ X,a,ct £ T}, such that G[V \ X] is the underlying 
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undirected graph of D[V\ X], and with additional edges as follows. For every x G X and a, a' G V, 
connect x(a,a') to terminal a® a'. Furthermore, for every arc (x,v) in D with v ^ X, and 
every a £ T, add an edge {x(a,(p(x,v)),v}. The terminals ofG(D,(p,x) are {a\, . . . ,a s }. 

Let G = G(D, (p, X) and let T be the set of terminals. Now feedback vertex sets in (D, </>) 
correspond to certain multiway cuts in (G, T), in a way that we define next. 

Definition 4. Let (G, T) be as above. A multiway cut Z of (G, T) is consistent if the following 
hold: For every x G X, there is at most one a G T such that some vertex x(a,a') in V does 
not occur in Z, and furthermore, letting ir(x) = a for every x G X such that x(a, a') ^ Z for 
some a' G T induces a consistent partial assignment on D[X]. A vertex x G X is deleted by Z if 
for every a G T some vertex x(a,a') occurs in Z. The cost of Z equals \Z n V\ plus the number 
of x £ X that are deleted by Z. 

Note that we slightly relax the consistency condition by not requiring that every literal of a 
deleted vertex x G X occurs in Z, as this will be convenient for the proof. The correspondence 
between multiway cuts in (G,T) and feedback vertex sets in (D,<p) is as follows. 

Lemma 10. (D,(f>) has a feedback vertex set of size at most k if and only if(G,T) has a consistent 
multiway cut of cost at most k, where cost and consistency are in the sense of Definition 

Proof. On the one hand, let Y be a feedback vertex set, and let tt be a consistent assignment 
for D — Y. Let Z = (Y \ X) U {x(a,a') G V : x G Y} U {x(a,a') G V : ir(x) / a,x $ Y}. 
Clearly, Z is consistent (if it is a multiway cut) and has cost \Y\. We show that Z is a multiway 
cut. In fact, we argue that for every u £ V \ X such that u is of distance two from a terminal a 
in G — Z, it must hold that ir(u) = a, and that for every pair of neighbors u,v G V \ X in G — Z we 
must have tt(u) = ir(v). The latter holds, since all edges of D — X have label lr- For the former, 
observe that the intermediate vertex must be x(ir(x), tt~ 1 (x) a) for some x G X \ Y . But then, 
there is an edge (x,u) in D with (j>{x,y) = 7r _1 (x) ® a, implying by consistency that tt(u) = a. 
Thus, consistency forbids that any vertex in V \ X is reached from more than one terminal. It is 
clear that no other paths between terminals are possible. 

On the other hand, consider a consistent multiway cut Z, and let Y consist of those x G X 
deleted by Z as well as Z D (V \ X). It is clear that \Y\ equals the cost of Z\ we will show 
that D — Y admits a consistent labeling. For this, we invert the previous observation: let ir(u) = a 
for all u G V \ X which are reachable from terminal a in G — Z, and tt(x) = a for every x £ X such 
that vertices x(a,a') are spared by Z. Finally, let ir(u) = lp if u is reachable from no terminal 
in G — Z. We argue that this is a consistent labeling. The labeling is consistent over internal 
edges in D — X — Y, and for non-reachable vertices u, since all corresponding arc labels are lr- 
Further, it is consistent over edges within D[X \ Y] by consistency of Z. Finally, it is consistent 
over edges (x, u) by design, by the same argument as before. □ 

Thus, we need to preserve all minimal consistent multiway cuts in (G,T). We show that the 
potentially highly reachable vertices of (G, T) identified by Lemma [5] still allow for an irrelevant 
vertex reduction rule with respect to consistent multiway cuts. 

Lemma 11. Let V* be the set of potentially highly reachable vertices identified by Lemma\^ applied 
to (G,T). Let v G (V \ X) \ V* . Then v is an irrelevant vertex, i.e., there exists an optimal 
consistent multiway cut Z with v £ Z . 
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Proof. Let Z with v € Z (V \ X) be a consistent multiway cut with minimum cost, in the sense 
of Definition HI Secondarily, boost Z to maximum cardinality, i.e., for every x £ X the number of 
vertices x(a, a') in Z is either s 2 or s(s-l). Now, let Z v := Zn(V\X) and Z T := ZnN(T) = Z\Z V . 
Since vertices x(a,a') G Z are contained in Zt Q N(T), they are effectively treated differently by 
the condition of high reachability. We show that Zy may be exchanged for a different cut Zy, such 
that Zt U Zy is a consistent multiway cut of at most the same cost, with v ^ Z' v . 

Let a be such that Z + v' + N(a) is not independent in (G, N(T)). We first argue that Z U 
N(a) is independent: assume not, and let C be an (N(T),Z U iV(a))-min cut closest to N(T). 
Let Z' T = Z T U iV(a); occurs on both sides of the cut, clearly Z' T C C. Thus C \ ^ is 

a min-cut (N(T) \ Z' T ,Zy), of size smaller than |Zy|. Since the cost of Zy is just its cardinality, 
replacing Zy by C \ Z' T in Z yields a multiway cut of smaller cost, which furthermore will be 
consistent. Thus Z L) N(a) is independent. 

Consider now an {N(T), ZU N(a) U {V})-min cut C closest to N(T), and let Cy = Cn(V\X). 
We argue that v ^ Cy and that Cy is a valid replacement for Zy in Z, i.e., that Cy can fill the 
role of the above promised Zy. As before, ZtU N(a) C C, and by independence |C| = |ZUiV(a)|, 
so |Cy| < \Zy\. Since C cuts v' from N(T) \ Z' T and by the minimality of C and independence 
of ZU N(a), we thus have u ^ Cy. As above, it is now clear that replacing Zy by Cy in Z gives a 
consistent multiway cut not containing v, of no larger cost than Z. □ 

Since the rank of the gammoid is r := s 2 |X| 2 , we get a polynomial kernel with C(r s+1 ) = 
0(f(s)k 2s+2 ) vertices. Note that we may also get a kernel directly in the language of T-Feedback 
Vertex Set (A;), since an irrelevant vertex may be made undeletable in the original setting as well. 

Theorem 12. Let T be a fixed group with s elements. T-Feedback Vertex Set(/c) admits a ran- 
domized polynomial kernel with 0(k 2s+2 ) vertices. The error probability can be made exponentially 
small in n, and errors are limited to false negatives. 

8 Omitted proofs 

8.1 Omitted proofs of Section [3] 

Proposition [TJ. Let D = (V,A) be a digraph with a set of source vertices S C V , and let icy. 
Let D' be the result of adding a sink-only copy x' for every vertex x G X. The following hold. 

1. The set X is closest to S in D if and only if X + x' is independent in the gammoid (D',S) 
for every x G X \ S. 

2. Let Xb be a maximal independent subset of X. A vertex v is reachable from S in D — C(X) 
if and only if Xb + v is independent in the gammoid (D 1 , S). 

Proof. For the first part, assume first that the condition holds. Let Z be a vertex min-cut (S, X) 
distinct from X, and let x G X \ Z. By assumption we have \Z\ = \X\ and x ^ S. However, 
since X + x' is independent in the gammoid, we have \X\ + 1 vertex-disjoint paths from S to X 
in D, of which Z will miss at least one. Thus Z cannot exist. 

On the other hand, let x G X \ S, and assume that X + x' is dependent in the gammoid. By the 
cut/flow duality, since the flow from S to X + x' in D' is \X\, there is an (S,X + x')-cut Z in D' 
of size \X\. We claim that x ^ Z: since X is independent, there are vertex-disjoint paths from S 
to X, which are hit exactly once each by Z; thus x' ^ Z. If x G Z, then by minimality of Z there 
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is a path from S to x avoiding Z — x. However, this path can also be used as a path from S to x' 
avoiding Z, contradicting that Z is a cut. Thus Z is a cut (S, X) of size \X\, which is closer to S 
than X is. Clearly, Z is also a cut in L>. 

For the second part, first assume that Xb + v is independent. The cut C(X) has size |^Cb|, 
and separates <S from X. Thus it will not intersect the path from S to v. On the other hand, 
assume that Xb + v is dependent, and consider an (S,X + v)-cut Z of size \Xg\. This also serves 
as an (S, X)-min-cut, which separates v from 5. Thus C(X) separates v from S\ Note that 
using (D, S) or (D' , 5) makes no difference here. □ 

8.2 Omitted proofs of Section [4] 

Theorem [6], T/ie Digraph Pair Cut problem can be solved in time 0*(2 k ). 

Proof. The algorithm uses the simple observation that any solution to Digraph Pair Cut can be 
transformed to a solution of at most the same size which performs a cut closest to S. 
Let a digraph D = (V, A), a vertex s S V, a set of pairs P, and an integer k be given. 

1. Initialize by creating copies S = {s±, . . . , Sk+i} of s (to make s undeletable), and let T = 0. 

2. Let C be the min-(S', T)-cut closest to S. If \C\ > k, reject the instance. If no pair is 
reachable in D — C, accept the instance. 

3. Otherwise, let p = {u, v} be a pair reachable in D — C. Branch two ways, adding either u 
or v to T and return to Step El 

Since every recursion in Step [3] increases the min-cut size A, at most k recursion steps are taken, 
and the algorithm runs in time 0*(2 k ) as promised. □ 

Lemma [2l Let D = (V, A) be a digraph, s S V, k an integer, and P C (^) a set of vertex pairs. 
In randomized polynomial time (with failure probability exponentially small in the input size) we 
can find a set of 0{k 2 ) pairs P* C P (the representative pairs,), such that for any set X C V \ {s} 
closest to s of at most k vertices, the graph D — X contains a reachable pair p £ P if and only if 
it contains a reachable pair p* £ P* . 

Proof. Let D, s, k, and P be as given. Split s into k + 1 copies; let S contain these vertices. Form 
a matroid M by taking two disjoint copies of the gammoid (D, S); let v(l) and v(2) be the copies 
of a vertex v E V in the first resp. the second part of this matroid. Now, form a set II by taking, 
for every {u,v} € P, the pair {u(l),v(2)}. We show that the set P* can be formed exactly by 
taking a 2/c-representative subset of II. Since the rank of M is 2k + 2, the size of P* follows from 
Lemma [TJ Observe that we may always assume that X is a closest set (to S), since the reachable 
pairs in D — C(X) are a subset of those reachable in D — X. 

Thus, let X be a closest set to S, and let X' = {x(l),x(2) : x G X} contain all copies of the 
vertices of X. We show that for any pair p = {u, v} G P, the pair {u(l),v(2)} extends X' if and 
only if p is reachable in D — X. For this, first observe that {u(l),v(2)} extends X' if and only if 
both u and v extend X in the gammoid (D, S), by construction. This happens if and only if p is 
reachable in D — X: Note that X is independent in (D, S), and C(X) = X. By Prop. [TJ we thus 
have that u is reachable in D — X if and only if X + u is independent in (D, S), and ditto for v. 

Thus, let II* be a 2/c-representative subset of n, and let P* contain those pairs occurring 
in II*. Assume that D — X contains a reachable pair p = {u,v} £ P. By the above, we have 
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that {u(l),v(2)} G II extends X' , and since \X'\ < 2k, some pair in II* extends X' as well. This 
represents some reachable pair p* G P* , which finishes the claim. 

As for running time and randomness behavior, since M is itself a gammoid, we may simply 
apply Theorem [5] and Lemma [TJ □ 

Theorem 0. There is a randomized polynomial-time compression algorithm for Digraph Pair 
Cut which given an instance I = (D, s, P, k) and a positive real e computes a compressed represen- 
tation of I of size 0(k 3 (k + log 1/e)). The success probability is at least 1 — e, and in the remaining 
cases the output encodes a positive instance. 

Proof. We begin by stating explicitly the dependency on the encoding gammoid. 

Claim 5. Let (D,s,P,k) be an instance of Digraph Pair Cut, and let P* C P be the repre- 
sentative pairs given by Lemma\M when applied to D, s, k, and P. Then {D,s,P,k) is positive if 
and only if there exists a set T C [j P* intersecting every p G P* such that the size of an (s, T)-cut 
disjoint from s is at most k. 

Proof. In one direction, assume that the size of an (s, T) cut is at most k for some T C [J P* . Let C 
be an (s, T)-cut that is closest to s. We know that no pair p* G P* is reachable from s in D — C, 
implying, by Lemma El that no pair p G P is reachable from s in D — C. Thus, C is a solution for 
the instance (D, s, P, k). 

On the other hand, every solution to the instance must in particular separate s from at least 
one member of every pair p G P* C P. Thus if no considered set T produces an (s, T)-cut of size 
at most k, then the instance is negative. □ 

By this claim, we only need to represent the sizes of cuts (s, T) distinct from s, with T C [JP*. 
Furthermore, we only need to represent whether the size of such a cut is bounded by k or not. 
Thus, create k + 1 parallel copies Sj of s, and create a gammoid with source set S = {si, . . . , s^+i} 
and sink set T = (J P* . By Lemma [2] and |25[ Corollary 1] , a representation of this gammoid with 
size 0(k 3 (k + log 1/e)) can be found in randomized polynomial time, with the requested success 
probability. □ 



8.3 Omitted proofs of Section 15.11 

Lemma [3]. Let (G,T,k) be an instance of Multiway Cut with deletable terminals(/c). An 
equivalent instance (G',T', k') with k! < k and \T'\ < 2k' can be computed in polynomial time. 

Proof. Create an instance (G, T, k) of Multiway CUT(fc), with undeletable terminals, by attaching 
an undeletable super-terminal t' to every terminal t G T, where t' is only neighbor to t. Let G be 
the resulting graph and T the set of new terminals. Then the solution sets of (G, T, k), interpreted 
with deletable terminals, and (G,T,k), interpreted with undeletable terminals, are identical. We 
may now use the LP-based approach of Guillemot [T7j, as also used in [8], to reduce the instance 
to at most 2k terminals. Let X be the support of a half-integral optimum of the LP relaxation 
of (G, T), as in [171 Sect. 2.2]. Let U C V be the vertices reachable from T in G — X. Then by |17[ 
Prop. 1], there is an optimal solution to (G,T), and thus to (G,T), disjoint from U. Now, contract 
the vertices of U \ T into T in G (it is clear that this is uniquely defined, as no vertex in U is 
reachable from two terminals in G — X). Note that every terminal reaches X via vertices only 
from U. We may now apply the reduction rules of [8], and delete members of X with at least two 
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neighbors in T (decreasing k accordingly), until what remains is at most \X\ < 2k vertices, with 
one neighbor in T each. Terminals that become isolated are naturally dropped. □ 

LemmalU Let (G,T,k) be an instance of Multiway Cut with deletable terminals(A;), and 
let X be a minimum size multiway cut ( of size k ) which as a secondary criterion has a maximum 
size intersection with T. Let V* be a representative set of V , as constructed above. Then for 
every x £ (X \ T) , the set X + x' + x" is linked to T, and X CTUV*. 

Proof We begin by showing that X + x' + x" is linked to T. Let T' = T \ X and X' = X \ T, 
and create a bipartite graph H = (X' U T', Eh) by connecting x £ X' to t £ T' if and only if x is 
reachable from t in G — (X — x). Assume that \X'\ > 0, since otherwise the claim is vacuous. Thus 
we also have \T'\ > 2 (since otherwise \X n T\ is not maximal - note that T \ {t} is a valid solution 
for any t £T). We make the following easy claim. 

Claim 6. Let M C Eh be a pseudomatching in H, i.e, every t £ T' intersects at most one edge 
in M . Then there is a corresponding set of vertex- disjoint paths from distinct terminals in T 1 to 
the endpoints of M in X' . 

Proof. Edges in H correspond to paths in G which intersect X only in their end points. Since 
every vertex in G — X is reachable from at most one terminal, we may arbitrarily take one path 
in G — X + x for every edge in M. □ 

Thus, we need to show that for every x £ X' there is a pseudomatching which saturates X' and 
intersects x three times. For this, we need another claim. 

Claim 7. For every S C X' we have \N(S)\ > \S\ + 1 in H. 

Proof. Assume that |iV(5)| < IS"! + 1 for some SCX', Let S' = N(S) — t for some arbitrary t E 
N(S). We claim that replacing S by S' in X yields a valid solution, of size at most \X\. Clearly, 
no terminal outside of N(S) is affected by removing S from the solution (since all paths from such 
terminals must go via other members of X, which remain in the solution). Thus, the only failure 
would be a path between members of N(S). However, only one member of N(S) remains undeleted. 
Thus we create a new solution, which has size at most |X| and intersects more terminals than X, 
contradicting our assumption on X. □ 

Now, the conclusion follows simply by Hall's condition. Create three copies x,x',x" for the 
arbitrary vertex x £ X' , and ask for a matching saturating X' + x' + x" . The non-existence of such 
a matching would imply a set S C X' such that |iV(5)| < \S + x' + x'\ = \S\ + 2, which by the 
previous claim contradicts our choice of X. 

Thus, there are vertex-disjoint paths from T' to X' + x' + x" , and similarly (adding trivial paths 
for X DT) from T to X + x' + x" , for any arbitrary x £ X' . 

For the final part of the conclusion, we note that {x,x',x"} uniquely extends X — x: For 
vertices y £ X \ {x} the tuple {y, y' ,y"} overlaps X — x. All other vertices y £ V \ X are reachable 
from at most one terminal in G — X, and from at most two terminals in G — (X — x) (where the 
second path goes through x). On the other hand, we just showed that {x,x',x"} extends X — x, 
and is thus contained in the representative set. □ 
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8.4 Omitted proofs of Section 15.21 



Lemma [6j Let v be a vertex which is not highly reachable under any multiway cut of size at most k 
and is not contained in N(T). Then there is an optimal multiway cut which does not contain v. 

Proof. Let X be an optimal multiway cut for G with v G X. Let t £ T such that X + v' + N{t) is 
not independent in (G',N(T)), according to Definition [2J It follows that there is a minimal cut C 
of size at most \X\ + |iV(i)| separating N(T) and X + v' + N(t). Clearly, C must contain N(t), so 
let X' := {XnN(t))U(C\N(t)). Then \X'\ < \X\: we have \C\ < \XUN{t)\ = \X\ + \N(t) \ X\, 
and \X'\ = \C\ -\N(t) \X\ (by choice). We show that (1) X' is a multiway cut for G and (2) v £ X' . 

(1) Assume otherwise and let P be a path connecting two terminals U,tj in G — X' with £j ^ tj\ 
w.l.o.g. ti ^ t. It follows that P contains at least one vertex of the multiway cut X. We know 
that C = X'L)N(t) separates ti from X. Since X' does not intersect P, it follows that C intersects P 
only in N(t) \ X. But then a prefix of P is a path from ti to t which avoids X, a contradiction. 

(2) Assume for contradiction that v G X' . By minimality of C it follows that some neighbor 
of v is reachable from T in G — C, implying that v' must be in C too. From minimality of X we 
have that X + N(t) is independent, implying \X \ N(t)\ vertex-disjoint paths Pi from N(T — t) 
to X \ N(t) avoiding N(t). Thus C \ N(t) intersects all these paths Pi; however, since v' is a sink- 
only copy, it cannot intersect any of the paths, and since v £ N(t) we have \C\ N(t)\ > \X\ N(t)\. 
But since C n N(t) = X n N(t), this contradicts the size of C. □ 

Theorem [9l s-Multiway CiJT(fc) has a randomized kernel of 0(k s+1 ) vertices. The error prob- 
ability can be made exponentially small in n, and errors are limited to false negatives. 

Proof. We begin with the correctness of Reduction Rule [TJ 

Reduction Rule[TJ, Let {G = (V,E),T,k) be a multiway cut instance, and let v £ V \ (T U N(T)) 
be a vertex which is not identified as potentially highly reachable by Lemma\M Create G' from G 
by replacing N(v) by a clique and removing v from the graph. Return (G',T,k). 

Claim 8. The instance (G',T,k) produced by Rule\^is equivalent to (G,T,k), and the rule can be 
performed exhaustively in polynomial time. 

Proof. We begin by showing correctness of the replacement. For soundness, assume that the optimal 
solution for G has size k, and let X' be a multiway cut for G' of size at most k — 1. Clearly, X' + v 
is a multiway cut for G. However, by minimality of the solution, there exists a path between two 
terminals which only intersects X' + v in v. This path still exists in G' and does not intersect X' , 
which contradicts X' being a solution. For completeness, by Lemma[6]we have an optimal multiway 
cut X for G with v ^ X. Assume that X fails to be a multiway cut in G' , i.e., there is a path 
between two terminals in G' avoiding X. But then there is a corresponding path in G which only 
adds the vertex v, and thus is still disjoint from X. 

Finally, we can perform the rule exhaustively by n repetitions of the cycle of detecting highly 
reachable vertices and removing one irrelevant vertex. □ 

By this, as long as not all vertices are identified as potentially highly reachable, we may (in 
polynomial time) apply a reduction rule decreasing |V(G) \(TU N(T)\. By our representativeness 
condition, at most 0{k s+l ) vertices are identified as potentially highly reachable, and the check 
can be performed in polynomial time. At most 4A; additional vertices remain in T U N{T). 
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Randomness is only required for getting a representation of a gammoid; the failure risk can be 
made exponentially small in the total input length with a polynomial running time (via Theorem [5]), 
without affecting the kernel size. One-sidedness of the error follows since applications of Reduction 
Rule Q] cannot make an instance easier. □ 

8.5 Omitted proofs of Section 15.31 

Theorem [3j Let G = (V, E) be a digraph and let S,T C V . Let r denote the size of a mini- 
mum (S,T)-vertex cut (which may intersect S andT). There exists a set Z C V , \Z\ = 0{\S\-\T\-r), 
such that for any ACS and B C T, it holds that Z contains a minimum (A, B)-vertex cut. We 
can find such a set in randomized polynomial time with failure probability 0{2~ n ). 

Proof. Let G, S, T be as given; we may assume w.l.o.g. that S are sources and T sinks in the graph 
(by adding source vertices before S and sink vertices after T; this does not modify any cuts). Let 
a vertex v G V be essential if there are sets A C S, B C T such that every minimum (A, B)- 
vertex cut contains v, and irrelevant otherwise. Let A C S and B C T, and let Ca resp. Cb be 
the minimum (A, B)-veitex cut closest to A resp. to B. The idea of the proof is that vertices 
essential for an (A, £>)-cut are exactly those in Ca nCg, and, in turn, by Proposition [fl we can use 
representative sets to find such vertices. 

Claim [H Let r denote the size of a minimum (S,T)-vertex cut (which may intersect S and T). 
We can find a set Z C V of 0(\S\ ■ \T\ ■ r) vertices which includes all essential vertices. 

Proof. We will create a matroid M with three layers as follows. Let Mq be the uniform matroid 
of rank r on V, i.e., Mq = (V, (<„))■ Create a graph G' = (V U V',E') from G by adding a 
sink-only copy v 1 of every v E V, and let Mi be the gammoid (G',S). Finally, create a graph G" 
from G by first reversing every edge, then adding a sink-only copy of every vertex, and let M 2 be 
the gammoid (G",T). Let M be the matroid created by a direct sum of Mq, Mi, and M2; for a 
vertex v E V, denote its copy in Mq by v(0), its two copies in Mi by v(l) and v'(l), and its two 
copies in M 2 by v{2) and v'{2). Form a set of tuples by T = {(v(0), v'(l), v'(2)) : v E V\(SUT)}. 
For a vertex v, let T(v) = (v(0), v'(l), v'(2)). Let T* C T be a representative set for T in M, and 
let V* = {v E V : T(v) E 7~*}. We show that every essential vertex is contained in V* U S U T. 

For this, let A C S and BCT, and let C be a minimum (^4, Z?)-vertex cut. Let v E V \ (S U T) 
be a vertex which is a member of every minimum (^4, £?)-vertex cut. Consider the set C consisting 
of the copies of (C — v) in Mq, the copies of (S \ A) U C in Mi, and the copies of (T \ B) U C 
in M 2 . By assumption, G is independent. We will show that, T(v) is the unique tuple in T which 
can extend C . 

Let Ca be the unique minimum (^4, i?)-vertex cut closest to A; pushing (S \ A) U C to a set 
closest to S yields (S \ A) U Ca- Thus, by Prop. CD (and using that S are source- vertices only), 
an element u'(l) extends C if and only if u' is reachable from A in G' — Ca- The situation with 
respect to T is symmetric (letting Cb be the min-cut closest to B). Thus, if T(u) extends C for 
some T{u) E 7", then v! is reachable both from A and from B; since C is a cut, we get u E Ca^Cb- 
But this implies u E C, and the only vertex u £ C such that u(0) extends C 1 is v. Thus T(-y) uniquely 
extends C, and ?; (and by extension, every essential vertex of V \ (S U T)) must be included in V*. 

Finally, the size of Z would by a naive application of Lemma[T]be 0((r + |5| + |T|) 3 ); however, 
an inspection of the proof of Lemma [1] given in [28] reveals that a bound of 0(15"! • \T\ ■ r) suffices, 
since the elements of T(v) are taken from different parts of a direct sum matroid. □ 
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Now, any vertex in V \ (Z U S U T) may safely be made undeletable (as in Reduction Rule [T]) 
without changing the size of any minimum (^4, .B)-vertex cut. By induction, this gives a set of 0(15*1 • 
\T\ ■ r) vertices which covers a minimum (A, B)-vertex cut in G for every A C S, B C T, as 
requested. □ 

Corollary [H Let G = (V, E) be a directed graph, and X C V a set of terminals. We can identify, 
in polynomial time, a set Z ofO(\X\ s ) vertices such that for any S, T, R C X, a minimum (S,T)- 
vertex cut in G — R is contained in Z. 

Proof. Create a graph G' by adding a source-only copy x~ for every x E X. For a set A C X, 
let A~ = {x~ : x 6 ^4}. Compute a set of vertices Z as in Theorem [3J using S = X~ and T = X. 
We show that the vertices of Z work as needed. 

For that, let S,T,R C X, and let A = S~ U R~ and B = T U R. We show a correspondence 
between minimum (^4, 5)-cuts in G' and minimum [S, T)-cuts in G—R. Let C be a minimum (S", T)- 
vertex cut in G — R, and let C' = CUR; it is clear that C' is an {A, 5)-vertex cut in Conversely, 
let C' be a minimum (A, B)-vertex cut in G' . Then C := C' \ (R~ U R) is a valid (S, T)-cut 
in Cr — R: Assume otherwise. Then there is a path from S to T avoiding C U R, which also 
avoids C' C C U R U -R~. By minimality of C 1 , for every x £ R, exactly one of x~ and x are 
contained in C'; thus \C' n (i?U = |J?|. 

We find that C' \(RU R~) is a minimum (S 1 , T)-vertex cut in G — R, for any minimum (A, B)- 
vertex cut C' , one of which will be covered by Z. This finishes our proof. □ 

Corollary [2J Digraph Pair Cut has a kernel of 0(k 4 ) vertices; Almost 2-SAT has a kernel 
of 0(k 6 ) variables. The signed graph generalizations of Odd Cycle Transversal and Edge 
Bipartization, where edges are marked as even or odd and the goal is to kill all odd-parity cycles, 
have kernels with 0(fe 4,5 ) respectively (D(k s ) vertices. All the above kernels are randomized, and 
can be derandomized with polynomial advice. 

Proof. The kernels for Digraph Pair Cut and Almost 2-SAT are most immediate. For Di- 
graph Pair Cut, first reduce the pairs to a representative set of 0(k 2 ) pairs, then apply the above 
with k + 1 copies of the source vertex as set S, and all vertices occurring in the remaining pairs 
as set T. Since the flow is bounded by \S\, we get 0(k 4 ) vertices. For Almost 2-SAT, we get a 
kernel since the Digraph Pair Cut instance resulting from the reductions is a valid instance of 
Almost 2-SAT. 

For Odd Cycle Transversal, first consider the compression form. Given a set X which 
is a valid solution, we first identify a set Z of 0(|X| 3 ) vertices which covers an optimal solution 
for the resulting terminal cut problem in an auxiliary graph G' . It follows from the proof of 
correctness of the Odd Cycle Transversal algorithm (see also [25]) that Z covers a minimum 
odd cycle transversal of the original graph G. Thus, for every path connecting u,w € Z using only 
vertices v £ (V \ Z), add an even or odd edge uw according to the parity of the path. The odd 
cycle transversal contained in Z is still valid, and we certainly do not make the problem easier by 
reducing down to Z. 

For Edge Bipartization, first use the approximation algorithm of Avidor and Langberg [2], 
providing an approximation ratio of O(logOPT). Let X be the returned solution. If \X\ > 
max(A;o, ck log k) for some constants k$, c, then we know that OPT is greater than k and may reject 
the instance (or output a dummy negative instance); otherwise |X| = 0{k\ogk). We may now 
transform the instance into a vertex deletion instance (by subdividing edges and turning vertices 
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into many parallel copies), in which case X becomes an odd cycle transversal for the resulting OCT 
instance (the size of X and the parameter k stay the same). Proceed as for OCT, producing a 
cut-covering set Z with 0(\X\ 3 ) = (D(k 3 ) vertices. We may assume (by the way the reduction into 
an OCT instance was performed) that every vertex in Z corresponds to an edge in the original 
instance. Let G = (V, E) be the original graph, and Ez Q E the set of edges chosen in Z; call Ez 
the marked edges. We assume (by correctness of the construction of Z) that the unmarked edges 
span no odd cycle. Finally, we produce a kernel with edge set Ez as follows. First, partition V 
according to components which are connected using only unmarked edges; since we will not have to 
delete any such edge, each such component can be collapsed into two vertices (a "negative" and a 
"positive" vertex). Then, we may further identify the two vertices in each component, keeping the 
positive, and transferring edges Ez intersecting a negative vertex into edges on positive vertices 
only, modifying the sign of the edge as necessary. This finishes the kernel. □ 

Theorem [4l Let G = (V, E) be an undirected graph and X C V . For any s, there exists a 
set Z C V, \Z\ = 0(\X\ S+1 ), such that for any partition X = (X±, . . . ,X S ) with pairwise disjoint 
subsets of X, it holds that Z contains a minimum multiway cut of X (i.e., a minimum cut C such 
that no pairs of sets X^Xj are connected to each other in G — C). We can find such a set in 
randomized polynomial time with failure probability 0(2~ n ). 

Proof. The proof will again be an irrelevant vertex construction, using the representative set condi- 
tion of Lemma[5]as guide. Call a vertex v G (V\X) essential if there is a partition X = (X\, . . . , X s ) 
of X' C X such that every minimum multiway cut of X contains v. As in Section f5. 21 we find that 
such an essential vertex must be highly reachable under the partition X. Essentially, what remains 
to be proven is that Lemma [5] does not really depend on knowing the partition X in advance. 

Claim [2]. We can identify (in polynomial time) a set of 0(\X\ S+1 ) vertices which includes all 
essential vertices. 

Proof. Let G' be the graph G with a sink-only copy v' added for every v G V, and a private source 
vertex x~ added for every x £ X. For a set iCI, let A~ = \x~ : x G ^4}. We create a gammoid 
as in Lemma O as a direct sum of s + 1 layers, where layer is the uniform matroid of rank \X\ onV, 
and layers 1 through s are copies of the gammoid (G',X~). For a vertex v, let v(i) (resp. v'(i)) 
denote the copy of v (resp. the sink-only copy of v) in layer i, and let T(v) = (v(0), v'(l), . . . , v'(s)). 
Let T = {T{v) : v G V\X}. Let T* be a representative set for T, and let V* = {v G V : T(v) G T*}. 
We show that every essential vertex is contained m V* U X . 

Let X = (X\, . . . , X s ) be a partition of X' C X, and let v G V\X be essential in X. Let Xq := 
X \ X' . Let C be a multiway cut of X; as was established in Section [5.21 we can saturate C + v' 
from (X 1 \ Xi) for every i G [s]; note that we can saturate a set A from [X' \Xi) if and only if we 
can saturate A U X~ U Xq from X~. For a set A C V, let A(i) = {v(i) : v G ^4}. Consider the set 

s 

6 := (C - v)(0) U |J(C U Xr U X ")(i). 

i=l 

As already argued, T(v) extends C. On the other hand, assume that T(u) extends C for u 7^ v; by 
layer 0, u C. By C, u must be reachable from X' \ Xi for every % G [s] (all other sources are killed 
at X~). But then u is reachable from two parts of X, contradicting that C is a multiway cut. □ 

The rest of the proof now proceeds as for Theorem [3j □ 
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9 Conclusion 



We give powerful new techniques for polynomial kernelization, centered around applications of a 
lemma from matroid theory due to Lovasz [26j and Marx [28]. The resulting tools significantly 
advance the field of kernelization, and imply polynomial kernels for a range of problems, including 
Almost 2-SAT(A:), s-terminal Multiway Cut(/c), and Multicut(A;) with s terminal pairs, for 
constant s, among other results. In particular, we show how the lemma can be applied to find 
irrelevant vertices for graph cut problems. In addition to the aforementioned kernels, this lets us 
find a form of cut-covering sets of small size: given a graph G and terminal set T, we can find a 
set Z, of size polynomial in \T\, such that for every A, B C T, a minimum (A, B)-veitex cut is 
contained in Z. Similarly, for a constant s, we can find a set Z of polynomial size such that for every 
partition of T (or a subset of T) into at most s partitions, a minimum multiway cut of the partition 
is contained in Z. We foresee further applications of these results. Similarly to in [25], our kernels 
are randomized; unlike in [25], they can all be made reduction rule-based. Furthermore, the failure 
probability can be reduced to be exponentially small in the input length, implying non-uniform 
polynomial-time kernelization. 

Despite being randomized, we note that all our kernels are compatible with the lower bound 
framework of Bodlaender et al. [3] and Fortnow and Santhanam [14]; see the discussion in [25] 
regarding coRP-kernels. Similar arguments can be made regarding non-uniform kernels; see [14^ 
Corollary 3.4]. Hence, concrete polynomial upper and lower bounds for the current problems is a 
relevant path of research (see Dell and van Melkebeek [10], Dell and Marx [9], and Hermelin and 
Wu [21]). Research on non-uniform kernels in general is also left as future work. 

Further significant open questions include the existence of polynomial kernels for the general 
form of Multiway Cut(/c) and Multicut (s+k), in edge- and vertex-deletion variants, and for the 
Group Feedback Arc Set(/c) and Group Feedback Vertex SET(fc) problems with arbitrary 
groups. Additionally, a polynomial kernel for Directed Feedback Vertex Set remains an open 
problem. 
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